Why multiprocess behavior differ between v1.12 and v1.9

您所在的位置：网站首页 › subprocess multiprocess › Why multiprocess behavior differ between v1.12 and v1.9

Why multiprocess behavior differ between v1.12 and v1.9

2023-04-05 02:08| 来源: 网络整理| 查看: 265

When I use torch==1.9.0, the following code runs fine.

import torch from multiprocessing import Process import multiprocessing def run(): print('in proc', torch.cuda.is_initialized()) print('in proc', torch.cuda._is_in_bad_fork()) torch.zeros((5,)).cuda() def fk_run(): print('in fork proc', torch.cuda.is_initialized()) print('in fork proc', torch.cuda._is_in_bad_fork()) torch.zeros((5,)).cuda() if __name__ == "__main__": print('in main', torch.cuda.is_initialized()) print('in main', torch.cuda._is_in_bad_fork()) sp_ctx = multiprocessing.get_context('spawn') a = sp_ctx.Process(target=run) a.start() fk_ctx = multiprocessing.get_context('fork') b = fk_ctx.Process(target=fk_run) b.start() a.join() b.join()

However, when run with torch=1.12 get a RuntimeError

RuntimeError: Cannot re-initialize CUDA in forked subprocess. To use CUDA with multiprocessing, you must use the 'spawn' start method

I check the source code of 1.9 and 1.12 and find no difference.

github.com pytorch/pytorch/blob/v1.12.0/torch/csrc/cuda/Module.cpp#L42 #ifndef WIN32 #include #endif using namespace torch; static bool in_bad_fork = false; // True for children forked after cuda init #ifndef WIN32 // Called in the forked child if cuda has already been initialized static void forked_child() { in_bad_fork = true; torch::utils::set_run_yet_variable_to_false(); } #endif // Should be called before the first cuda call. // Note: This is distinct from initExtension because a stub cuda implementation // has some working functions (e.g. device_count) but cannot fully initialize. static void poison_fork() { #ifndef WIN32 github.com pytorch/pytorch/blob/v1.9.0/torch/csrc/cuda/Module.cpp#L37 #include #endif using namespace torch; THCState *state = nullptr; static bool in_bad_fork = false; // True for children forked after cuda init #ifndef WIN32 // Called in the forked child if cuda has already been initialized static void forked_child() { in_bad_fork = true; torch::utils::set_run_yet_variable_to_false(); state = nullptr; } #endif // Should be called before the first cuda call. // Note: This is distinct from initExtension because a stub cuda implementation // has some working functions (e.g. device_count) but cannot fully initialize. static void poison_fork() {

Besides, the demo code doesn’t init CUDA in the main process, why prompts Cannot re-initialize CUDA in forked subprocess ?

【本文地址】

Why multiprocess behavior differ between v1.12 and v1.9

Why multiprocess behavior differ between v1.12 and v1.9

今日新闻

推荐新闻